
SEQUENCE LISTING 



<110> Scrathmann AG 5c Co . 

<120> Virus-Vaccine 

<130> P057760 

<140> 
<141> 

<150> 199 07 485.2 
<151> 1999-02-12 

<160> 12 

<170> Patentln Ver . 2.1 

<210> 1 
<211> 9709 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 1 

tggaagggcc aatctggtcc caaaaaagac aagagatcct tgacccgcgg atcuaccaca 60 

cacaaggcta cttccctgat tggcagaacu acacaccagg gccagggatc agatatccac 120 

tgacctttgg atggtgcttc aagttagtac cagttgaacc agagcaagta gaagaggcca 180 

aataaggaga gaagaacagc ttgtcacacc ctacgagcca gcatgggacg gaggacccgg 240 

agggagaagt attagtgtgg aagtttgaca gcctcccagc acttcgtcac atggcccgag 300 

agctgcatcc ggagtactac aaagaccgcc gacatcgagc tctccacaag ggactctccg 3 60 

ctggggactt tccagggagg tgcggcccgg gcgggactgg ggagtggcga gccctcagat 42 0 

gctacatata agcagctgct ttttgcctgc actgggtctc tctggttaga ccagatctga 480 

gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagctcgcct 540 

tgagcgccca aagtagtgtg tgcccgcccg tcgcgtgact c-ggtaacta gagacccctc 600 

agaccct:t:ct agtcagtgtg gaaaatctc: agcagtggcg cccgaacagg gactcgaaag 660 

cgaaagcaaa gccagaggag atctctcgac gcaggactcg gcctgctgaa gcgcgcacgg 720 

caagaggcga ggggcggcga ctggtgagta cgccaaaaat tccgactagc ggaggctaga 780 

aggagagaga tgggtgcgag agcgtcggta ttaagcgggg gagaattaga taaatgggaa 840 

aaaattcggt taaggccagg gggaaagaaa caatataaac taaaacatat agtatgggca 900 

agcagggagc tagaacgatt cgcagttaat cctggccttt tagagacatc agaaggctgt 960 

agacaaatac tgggacagct acaaccaccc cttcagacag gatcagaaga acttagatca 1020 

ttatataata caatagcagt cctctatcgt gcgcaccaaa ggacagatgt aaaagacacc 1080 

aaggaagcct tagataagat agaggaagag caaaacaaaa gtaagaaaaa ggcacagcaa 1140 

gcagcagctg acacaggaaa caacagccag gtcagccaaa attaccctat agtgcagaac 1200 

ctccaggggc aaatggtaca tcaggccata tcacctagaa ctttaaatgc atgggtaaaa 1260 

gcagtagaag agaaggcttt cagcccagaa gtaataccca tgttttcagc attatcagaa 1320 

ggagccaccc cacaagattt aaataccacg ctaaacacag tggggggaca tcaagcagcc 1380 

atgcaaatgt taaaagagac catcaatgag gaagctgcag aatgggatag attgcatcca 1440 

gtgcatgcag ggcctattgc accaggccag atgagagaac caaggggaag tgacacagca 1500 

ggaactacta gtacccttca ggaacaaata ggatggatga cacataatcc acctatccca 1560 

gtaggagaaa tctataaaag acggataatc ctgggattaa ataaaatagt aagaatgtat 1620 

agccctacca gcattctgga cataagacaa ggaccaaagg aaccctttag agactatgta 1680 

gaccgattct ataaaactct aagagccgag caagcttcac aagaggtaaa aaattggatg 1740 

acagaaacct tgttggtcca aaatgcgaac ccagattgta agactatttt aaaagcattg 1800 

ggaccaggag cgacactaga agaaacgacg acagcatgcc agggagtggg gggacccggc 1860 

cataaagcaa gagttttggc tgaagcaacg agccaagtaa caaatccagc taccataatg 1920 



atacagaaag gcaatttcag gaaccaaaga 
gaagggcaca tagccaaaaa ttgcagggcc 
aaggaaggac accaaatgaa agactgcacc 
tggcctcccc acaagggaag gccagggaat 
ccaccagaag agagcttcag gtttggggaa 
ccgatagaca aggaactgta tcctttagct 
tcgtcacaat aaagataggg gggcaactaa 
atacagtatt agaagaaatg aatttgccag 
ttggaggttn tatcaaagta ggacagtatg 
aagccatagg tacagtatta gtaggaccta 
tgacccagat tggccgcact: ttaaattutc 
aaccaaagcc aggaatggat ggcccaaaag 
taaaagcatt agtagaaatt cgtacagaaa 
ggcctgaaaa tccatacaat actccagtat 
ggagaaaatt agtagatccc agagaactca 
aattaggaat accacatcct gcagggtcaa 
tgggcgatgc atacttttca gttcccttag 
ccatacctag tataaacaat gagacaccag 
agggatggaa aggatcacca gcaacacccc 
tcagaaaaca aaatccagac atagtcatct 
ctgactuaga aatagggcag catagaacaa 
ggcggggatt taccacacca gacaaaaaac 
gctatgaact ccatcctzgat aaatggacag 
gccggactgt caatgacata cagaaatuag 
atgcagggat taaagtaagg caattatgta 
aagtagtacc actaacagaa gaagcagagc 
aagaaccggt acatggagtg tattatgacc 
agcaggggca aggccaatgg acatatcaaa 
caggaaaaca tgcaagaatg aagggtgccc 
cagtacaaaa aatagccaca gaaagcatag 
tacccataca aaaggaaaca tgggaagcat 
ttcctgagtg ggagcttgtc aatacccctc 
aagaacccat aataggagca gaaacttcct 
aattaggaaa agcaggacat gtaactgaca 
acacaacaaa tcagaagact gagttacaag 
tagaagtaaa catagtgaca gactcacaat 
ataagagcga atcagagtzta gccagtcaaa 
tctacctggc atgggtacca gcacacaaag 
tggtcagtgc tggaatcagg aaagtactat 
aacatgagaa atatcacagt aattggagag 
tagcagcaaa agaaacagta gccagctgtg 
atggacaagt agactgcagc ccaggaatat 
aagttatctt ggtagcagtt catgtagcca 
cagagacagg gcaagaaaca gcatacttcc 
aaacagtaca tacagacaat ggcagcaatt 
gg^gggcggg gaccaagcag gaatttggca 
tagaatctat gaataaagaa ttaaagaaaa 
atcttaagac agcagtacaa atggcagtat 
ttggggggta cagtgcaggg gaaagaatag 
aagaattaca aaaacaaatt acaaaaattc 
gagatccagt ttggaaagga ccagcaaagc 
tacaagataa tagtgacata aaagtagtgc 
acggaaaaca gatggcaggt gatgattgtg 
tggaaaagat tagtaaaaca ccatatgtat 
agacatcact atgaaagtac taatccaaaa 
gacgctaaat tagtaataac aacatattgg 
ttgggtcagg gagtctccat agaatggagg 




aagactgt-a agcgttccaa ttgcggcaaa 1980 
cctaggaaaa agggctgttg gaaatgtgga 2040 
gagagacagg ccaatxtcnt agggaagatc 2100 
cttcttcaga gcagaccaga gccaacagcc 2160 
gagacaacaa c:ccctccca gaagcaggag 2220 
tccctcagac caccctttgg cagcgacccc 2280 
aggaagctcc atragacaca ggagcagatg 2340 
gaagatggaa accaaaaacg atagggggaa 2400 
atcagatact: cacagaaatc tgcggacata 2460 
cacctgtcaa cacaattgga agaaatctgt 2520 
ccattagtcc taccgagact gcaccagtaa 2580 
ttaaacaacg gccactgaca gaagaaaaaa 2640 
tggaaaagga aggaaaaatt tcaaaaattg 2700 
ttgccacaaa gaaaaaagac agtaccaaat 2760 
ataagagaac tcaagatttc tgggaagtcc 2820 
aacagaaaaa aucagtaaca gtaccggatg 2880 
ataaagactc caggaagtat actgcaCtta 2940 
ggattagata ccagcacaac gtgctcccac 3000 
agtgcagcac gacaaaaatc ttagagcctt 3060 
atcaacacat ggacgatccg tatgtaggat 3120 
aaacagagga aczgagacaa catctgttga 3180 
atcagaaaga acctccactc ccttggatgg 3240 
tacagcccae agzgctgcca gaaaaggaca 33 00 
tgggaaaatt gaattgggca agtcagattt 3350 
aactccttag gggaaccaaa gcactaacag 342 0 
tagaaccggc agaaaacagg gagattctaa 3480 
catcaaaaga cc~aatagca gaaatacaga 3540 
tttaccaaga gccatttaaa aatctgaaaa 3600 
acactaatga tgcgaaacaa tcaacagagg 3660 
taacaugggg aaagactccc aaacccaaat 3720 
ggtggacaga gcautggcaa gccacctgga 3780 
ccttagcgaa gccacggcac cagttagaga 3840 
atgtagatgg ggcagccaac agggaaacta 3900 
gaggaagaca aaaagttgcc cccctaacgg 3960 
caatccatc- acccttgcag gattcgggat 4020 
atgcactggg aaccattcaa gcacaaccag 4080 
taatagagca gtcaataaaa aaggaaaaag 4140 
gaatcggagg aaatgaacaa gtagatgggt 4200 
ttttagatgg aacagataag gcccaagaag 4260 
caatggctag tgactttaac ccaccacctg 4320 
acaaacgcca gcnaaaaggg gaagccatgc 43 8 0 
ggcagctaga ttgcacacat ttagaaggaa 4440 
gtggacacat agaagcagaa gcaattccag 4500 
tcttaaaatt agcaggaaga tggccagtaa 4560 
tcaccagtac tacagctaag gccgcctgtt 4620 
ttccctacaa tccccaaagt caaggagtaa 4680 
ttacaggaca ggcaagagat caggctgaac 4740 
tcatccacaa tcttaaaaga aaagggggga 4800 
tagacataac agcaacagac acacaaacta 4860 
aaaatcttcg ggtttattac agggacagca 4920 
tcctctggaa aggcgaaggg gcagtagtaa 4980 
caagaagaaa agcaaagatc atcagggatt 5040 
tggcaagtag acaggatgag gattaacaca 5100 
atttcaagga aagctaagga ctggttttat 5160 
ataagttcag aagtacacat cccactaggg 5220 
ggtctgcata caggagaaag agactggcat 5280 
aaaaagagat atagcacaca agtagacccc 5340 



gacctagcag 
agaaatacca 
aaggtaggac 
ccacctctgc 
aagggccaca 
gtgaagctgt 
aaacttacgg 
tgtttatcca 
gagcaagaaa 
cccaaaaccg 
ttcatgacaa 
gctcaccaga 
atgcaaccta 
gtgtggtcca 
ttaattgaca 
tcagcacttg 
ctgcagtgct 
agcaaccacc 
tgtttgggcc 
aaatgtgaca 
tacaatcagc 
tagtttaaag 
gataacggag 
taaggtgcag 
cagctacagg 
ctttgagcca 
taataagacg 
tggaatcagg 
tgtagtaatt 
cacatctgta 
ccagagggga 
acattgtaac 
aagagaacaa 
agaaattgta 
actgtctaac 
aggaagtgac 
agtaggaaaa 
tactgggctg 
acctggagga 
aaaaattgaa 
aaaaagagca 
tatgggctgc 
gcagcagcag 
agtctggggc 
tcaacagctc 
ttggaatgct 
ggagtgggac 
gcaaaaccag 
gtggaattgg 
aggaggctcg 
gcagggatat 
gcccgaagga 
gaacggatcc 
ccgcttgaga 
gtgggaagcc 
tagtgctgtt 
tacagaagta 



accaaccaat 
tactaggacg 
ctctacagta 
ctagtgttag 
gagggagcca 
tagacatttt 
ggatacttgg 
tttcagaatt 
tggagccagt 
cttgtaccaa 
aagccttagg 
acagtcagac 
taatagtagc 
tagtaatcat 
gactaataga 
tggagatggg 
acagaaaaat 
actctatttt 
acacatgcct 
gaaaatttta 
ttatgggatc 
tgcactgatc 
aaaggagaga 
aaagaatatg 
ccgataagtc 
attcccatac 
ttcaatggaa 
ccagtagtat 
agatctgcca 
gaaattaat t 
ccagggagag 
attagtagag 
tttggaaata 
acgcacagtt 
agtacttggt 
acaatcacac 
gcaatgtatg 
ctattaacaa 
ggcgatacga 
ccattaggag 
gtgggaatag 
acgtcaatga 
aacaatttgc 
atcaaacagc 
ctggggat tt 
agttggagta 
agagaaatta 
caagaaaaga 
tttaacataa 
gtaggtttaa 
tcaccattat 
atagaagaag 
ttagcactta 
gacttactct 
ctcaaatatt 
aacttgctca 
ttacaagcag 



tcacctgcac 
tatagttagt 
cttggcacta 
gaaactgaca 
tacaatgaat 
cctaggacat 
gcaggagcgg 
gggtgtcgac 
agatcctaga 
ttgctatcgt 
catctcctat 
tcatcaagcc 
aatagtagca 
agaatatagg 
aagagcagaa 
ggcggaaatg 
tgtgggtcac 
gtgcatcaga 
gtgtacccac 
acaugtggaa 
aaagcctaaa 
tgaagaatga 
taaaaaactg 
catcctt tea 
gcaacacccc 
attattgege 
caggaccatg 
caacccaact 
act tcacaga 
gtacaagacc 
cattngttac 
caaaatggaa 
ataaaacaat 
ttaatcgtgg 
ttaatagcac 
tcccatgcag 
ccccccccat 
gagacggtgg 
gggacaattg 
tagcacccac 
gagctttgtt 
cgctgacggt 
tgagggctat 
tccaggcaag 
ggggutgetc 
ataaatctcc 
acaattacac 
atgaacaaga 
caaactggct 
gaatagtttt 
cgtttcagac 
aaggtggaga 
tctgggacga 
tgattgtaac 
ggtggaatct 
atgccacagc 
cttatagagc 



cattttgatt 
cctaggtgcg 
gcagcactaa 
gaggacagac 
ggacactaga 
ggctccataa 
aagccacaat 
atagcagaat 
ctagagccct 
aaaaagtgtt 
ggcaggaaga 
tctctatcaa 
ttagtagtag 
aaaacattaa 
gaeageggea 
gggcaccatg 
agtctattat 
tgetaaagea 
agaccccaac 
aaacgacacg 
gccacgtgta 
caccaatacc 
ccctcccaat 
taaacctgat 
agtcaccaca 
cccggccggt 
cacaaacgtc 
getgeeaaat 
caatgecaaa 
caacaacaat 
aataggaaaa 
tgecacttta 
aatccccaag 
aggggaattt 
teggagcact 
aataaaacaa 
cagrggacaa 
taataacaac 
gagaagtgaa 
caaggcaaag 
cctcgggttc 
acaggecaga 
tgaggcgcaa 
aatcccggct 
tggaaaactc 
ggaacagatt 
aagcttaata 
attaceggaa 
geggnatata 
tgctgtactt 
ccacccccca 
gagagacaga 
tetgeggage 
gaggategtg 
cctacagtat 
catagcagta 
tattcgccac 



gtccttcaga 
aatatcaagc 
taaaaccaaa 
ggaacaagee 
gcttttagag 
cttaggacaa 
aagaattccg 
aggegtcact 
ggaagcatcc 
gctttcattg 
ageggagaca 
agcagtaagt 
caataataat 
gacaaagaaa 
atgagagtga 
ctccttggga 
ggggtacctg 
tatgatacag 
ccacaagaag 
gcagaacaga 
aaaccaaccc 
aatagtagta 
atcagcacaa 
atagtaccaa 
caggcctgtc 
tttgegatte 
agcacagtac 
ggcagtctag 
accataatag 
acaagaaaaa 
ataggaaata 
aaacagatag 
caaccctcag 
ttctactgta 
gaagggtcaa 
tttacaaaca 
atcagatgtt 
aatgggtccg 
ttatataaat 
agaagagtgg 
ttgggagcag 
caattattgt 
cagcatctgt 
geggaaagat 
atttgeacca 
tggaataaca 
caccccttaa 
ttagataaat 
aaactattca 
tccataguga 
atcccgaggg 
gacagatcca 
ctgtgcctct 
gaacttctgg 
tggagtcagg 
gctgagggga 
atacctagaa 



atetgetata 
aggacacaac 
acagacaaag 
ccagaagacc 
gaactcaaga 
catatctatg 
caacaaccgc 
cgacagagga 
aggaagtcag 
ccaagtttgt 
gcgacgaaga 
agtacatgta 
agcaatagtt 
aatagacagg 
aggagaagta 
tattgatgat 
tgtggaagga 
aggtacataa 
tagtactggt 
tgcatgagga 
caccctgtgt 
gegggagaac 
gcataagaga 
tagacaatac 
caaaggcatc 
taaaatguaa 
aacgcacaca 
cagaagaaga 
tacagctgaa 
gtatcegtat 
tgagacaagc 
ctagcaaatt 
gaggggaccc 
atccaacaca 
ataacactga 
tgtggcagga 
catcaaatat 
agatcttcag 
ataaagtagt 
tgcagagaga 
caggaagcac 
ctgatatagt 
tgcaacccac 
acctaaagga 
ctgccgtgcc 
tgacctggat 
ttgaagaatc 
gggcaagttt 
taatgatagt 
atagagttag 
gacccgacag 
ttcgactagt 
tcagctacca 
gaegcagggg 
aactaaagaa 
cagatagggt 
gaataagaca 



5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
5600 
6660 
6720 
6780 
6840 
5900 
6960 
7020 
7.080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 



gggcttggaa 
gatggcctgc 
cagtatctcg 
atgctgctcg 
ctcaggtacc 
aagaaaaggg 
tgtggatcta 
gggtcagata 
aggtagaaga 
gaatggatga 
atcacgtggc 
acaagggact 
ggcgagccct 
ttagaccaga 
caataaagct 
aactagagat 



aggattttgc 
tgtaagggaa 
agacctagaa 
tgcctggcta 
tttaagacca 
gggactggaa 
ccacacacaa 
tccactgacc 
ggccaataaa 
ccctgagaga 
ccgagagctg 
ttccgctggg 
cagatgctgc 
tctgagcctg 
tgccttgagt 
ccctcagacc 



tataagatgg 
agaatgagac 
aaacatggag 
gaagcacaag 
atgacttaca 
gggctaattc 
ggctacttcc 
tttggatggt 
ggagagaaca 
gaagtgttag 
catccggagt 
gactttccag 
atataagcag 
ggagctctct 
gcttcaagta 
cttttagtca 



gtggcaagtg 
gage t gage c 
caatcacaag 
aggaggaaga 
aggcagctgc 
actcccaaag 
ctgattggca 
gctacaagct 
ccagcttgtt 
agtggaggt" 
acttcaagaa 
ggaggcgtgg 
ctgctttttg 
ggctaactag 
gtgtgtgccc 
gtgtggaaaa 



gecaaaaage 
agcagcagat 
tagcaataca 
ggtgggtttt 
agatcttagc 
aagacaagat 
gaactacaca 
agtaccagtt 
acaccctgtg 
tgacagccgc 
ctgetgacat 
cctgggcggg 
cctgtactgg 
ggaacccact 
gtctgttgtg 
tctccagca 



agtgtgattg 
ggggtgggag 
gcagctaaca 
ccagtcacac 
cacttttztaa 
atccttgatc 
ccagggccag 
gagecagata 
agectgeatg 
ctagcatttc 
egagcttget 
actggggagt 
gtctctctgg 
gettaagect 
tgactctggt 



8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9709 



<210> 2 
<211> 854 
<212> PRT 

<213> Human immunodeficiency virus 
<220> 

<223> Envelope Polyprotein 
<400> 2 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Lys 
15 10 15 

Trp Gly Thr Met Leu Leu Gly lie Leu Met lie Cys Ser Ala Thr Glu 
20 25 30 

Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Vai Trp Lys Glu Ala 
35 40 45 

Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
50 55 60 

Val His Asn Val Trp Ala Thr His Ala Cys Vai Pro Thr Asp Pro Asn 
65 70 75 80 

Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
85 90 95 

Lys Asn Asp Met Val Glu Gin Met His Glu Asp He He Ser Leu Trp 
100 105 110 

Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 
115 120 125 

Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser 
130 135 140 



Gly Arg Met He Met Glu Lys Gly Glu He Lys Asn Cys Ser Phe Asn 
145 150 155 160 



lie Ser Thr Ser He Arg Asp Lys Val Gin Lys Glu Tyr Ala Phe Phe 
165 170 175 



Tyr Lys Leu Asp He Val Pro He 
180 

Ser Cys Asn Thr Ser Val He Thr 
195 200 

Glu Pro He Pro He His Tyr Cys 
210 215 

Lys Cys Asn Asn Lys Thr Phe Asn 
225 230 

Ser Thr Val Gin Cys Thr His Gly 
245 

Leu Leu Leu Asn Gly Ser Leu Ala 
260 

Ala Asn Phe Thr Asp Asn Ala Lys 

275 280 



Asp Asn Thr Ser Tyr Arg Leu He 
185 190 ■ 

Gin Ala Cys Pro Lys Val Ser Phe 
' 205 

Ala Pro Ala Gly Phe Ala He Leu 
220 

Gly Thr Gly Pro Cys Thr Asn Val 
235 240 

He Arg Pro Val Val Ser Thr Gin 
250 255 

Glu Glu Asp Val Val He Arg Ser 
265 270 

Thr He He Val Gin Leu Asn Thr 
285 



Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser 
290 295 300 

He Arg He Gin Arg Gly Pro Gly Arg Ala Phe Val Thr He Gly Lys 
305 310 315 320 

He Gly Asn Met Arg Gin Ala His Cys Asn He Ser Arg Ala Lys Trp 
325 330 335 

Asn Ala Thr Leu Lys Gin lie Ala Ser Lys Leu Arg Glu Gin Phe Gly 
340 345 350 

Asn Asn Lys Thr lie lie Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu 
355 360 365 

He Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn 
370 375 380 

Ser Thr Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr 
385 390 395 400 

Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr lie Thr Leu Pro Cys 
405 410 415 

Arg lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala Met 
420 425 430 



Tyr Ala Pro Pro He Ser Gly Gin lie Arg Cys Ser Ser Asn lie Thr 
435 440 445 

Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu 



450 

lie Phe Arg Pro Gly 
465 

Leu Tyr Lys Tyr Lys 
485 

Thr Lys Ala Lys Arg 
500 

lie Gly Ala Leu Phe 
515 

Gly Cys Thr Ser Met 
530 

Asp He Val Gin Gin 
545 

Gin His Leu Leu Gin 
565 

Arg He Leu Ala Val 
580 

He Trp Gly Cys Ser 
595 

Asn Ala Ser Trp Ser 
610 



455 

Gly Gly Asp Met Arg 
470 

Val Val Lys He Glu 
490 

Arg Val Val Gin Arg 
505 

Leu Gly Phe Leu Gly 
520 

Thr Leu Thr Val Gin 
535 

Gin Asn Asn Leu Leu 
550 

Leu Thr Val Trp Gly 
570 

Glu Arg Tyr Leu Lys 
585 

Gly Lys Leu He Cys 
600 

Asn Lys Ser Leu Glu 
615 




460 

Asp Asn Trp Arg Ser Glu 
475 480 

Pro Leu Gly Val Ala Pro 
495 

Glu Lys Arg Ala Val Gly 
510 

•Ala Ala Gly Ser Thr Met 
525 

Ala Arg Gin Leu Leu Ser 
540 

Arg Ala He Glu Ala Gin 
555 560 

He Lys Gin Leu Gin Ala 
575 

Asp Gin Gin Leu Leu Gly 
590 

Thr Thr Ala Val Pro Trp 
605 

Gin He Trp Asn Asn Met 
620 



Thr Trp Met Glu Trp Asp Arg Glu He Asn Asn Tyr Thr Ser Leu lie 
625 * ' 630 635 640 

His Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin 
645 650 655 

Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn 
660 665 670 

He Thr Asn Trp Leu Trp Tyr lie Lys Leu Phe lie Met He Val Gly 
675 680 685 

Gly Leu Val Gly Leu Arg He Val Phe Ala Val Leu Ser He Val Asn 
690 695 700 

Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro 
705 710 715 720 

He Pro Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
725 730 735 



Glu Arg Asp Arg Asp Arg Ser lie Arg Leu Val Asn Gly Ser Leu Ala 
740 745 750 



Leu lie Trp Asp Asp 
755 

Leu Arg Asp Leu Leu 
770 

Arg Arg Gly Trp Glu 
785 

Trp Ser Gin Glu Leu 
805 

Ala He Ala Val Ala 
820 

Ala Ala Tyr Arg Ala 
835 



Leu Arg Ser Leu Cys 
760 

Leu He Val Thr Arg 
775 

Ala Leu Lys Tyr Trp 
790 

Lys Asn Ser Ala Val 
810 

Glu Gly Thr Asp Arg 
825 

He Arg His He Pro 
840 




Leu Phe Ser Tyr His Arg 
765 

He Val Glu Leu Leu Gly 
780 

Trp Asn Leu Leu Gin Tyr 
795 800 

Asn Leu Leu Asn Ala Thr 
815 

Val He Glu Val Leu Gin 
830 

Arg Arg He Arg Gin Gly 
845 



Leu Glu Arg He Leu Leu 
850 



<210> 3 
<211> 107 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide for cloning 

<400> 3 

aagatgtagt aattagatct gccaatttca cagacaatgc taaaaccata atagtacagc 60 
tgaacacatc gttagaaatt aattgtacaa gacccaacaa caataca 107 



<210> 4 
<211> 120 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide for cloning 

<220> 

<221> misc_f eature 
<222> (97) . . (99) 

<223> Sequence at this position: (GA) (AT) (GATC) , ie . 
base at position 97 can be G or A, base at 
position 98 can be A or T, and base at 
position 99 can be G, A, T or C . 

<400> 4 

ttttgctcta gaaatgttac aatgtgcttg tcttatgtct cctgttgcag cttctgttgc 60 
atgaaatgct ctccctggtc cgatatggat actatgrwnt tttcttgtat tgttgttggg 120 



# 



<210> 5 
<21i> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence 
sequencing primer 

<400> 5 

ccatgtacaa atgtcag 

<210> 6 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence 
sequencing primer 

<400> 6 

aaaactgtgc gttacaa 

<210> 7 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence 
sequencing primer 

<400> 7 

gtaaaacgac ggccagt 



<210> 8 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence 
sequencing primer 

<400> 8 

caggaaacag ctatgac 



<210> 9 
<211> 2148 



# • 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: synthetic DNA 



<220> 

<221> misc — feature 

<222> (3) . . (9) 

<223> BstEII cleavage site 



<220> 

<221> misc_f eature 
<222> (2143) . . (2148) 
<223> BamHI cleavage site 



<400> 9 

tgggtcaccg 

gcatcagatg 

gtacccacag 

atgtggaaaa 

agccttaagc 

aagaatgata 

aaaaactgca 

ttcttttata 

aacacctcag 

tattgtgccc 

ggaccatgta 

actcaactgc 

ttcacagaca 

acaagaccca 

tttgttacaa 

aaatggaatg 

aaaacaataa 

aattgtggag 

aatagtactt 

ccatgcagaa 

cctcccatca 

gatggtggta 

gataattgga 

gcacccacca 

gctctgttcc 

ctgacggtac 

agggcaattg 

caggcaagaa 

ggttgctctg 

aaatctctgg 

aattacacaa 

gaacaagaat 

aattggctgt 

atagtttttg 

tttcagaccc 

ggtggagaga 



tctattatgg 
ctaaagcata 
accccaaccc 
atgacatggt 
catgtgtaaa 
ctaataccaa 
gcttcaatat 
aacttgatat 
tgatcacaca 
cggctggttt 
caaatgtcag 
tgttaaatgg 
atgctaaaac 
acaacaatac 
taggaaaaat 
ccactttaaa 
tctttaagca 
gggaattttt 
ggagtactga 
taaaacaatt 
gtggccaaat 
ataacaacaa 
gaagtgaatt 
aggcaaagag 
ttgggttctt 
aggccagaca 
aggcgcaaca 
tcctggctgt 
gaaaactcat 
aacagatttg 
gcttaataca 
tattggaatt 
ggtatataaa 
ctgtactttc 
acctcccaat 
gagacagaga 



ggtgcctgtg 
tgatacagag 
acaagaagta 
agaacagatg 
attaacccca 
tagtagtacc 
cagcacaagc 
agtaccaata 
ggcctgtcca 
tgcgatccta 
cacagtacaa 
cagtctagca 
cataatagta 
aagaaaaagt 
aggaaatatg 
acagatagct 
gtcatccgga 
ctactgtaat 
agggtcaaat 
tataaacatg 
tagatgttca 
tgggtccgag 
atataaatat 
acgcgtggtg 
gggagcagca 
attattgtct 
gcatctgttg 
ggaaagatac 
ttgcaccact 
gaataacatg 
ctccttaatt 
agataaatgg 
attattcata 
tatagtgaat 
cccgagggga 
cagatccatt 



tggaaggaag 
gtacacaatg 
gtattggtaa 
catgaggata 
ctctgtgtta 
gggagaatga 
ataagagata 
gataatacca 
aaggtatcct 
aaatgtaata 
tgtacacatg 
gaagaagatg 
cagctgaaca 
atccgtatcc 
agacaagcac 
agcaaattaa 
ggggacccag 
ccaacacaac 
aacactgaag 
tggcaggaag 
tcaaatatta 
attttcagac 
aaagtagtaa 
cagagagaaa 
ggaagcacta 
gatatagtgc 
caactcacag 
ctaaaggatc 
gctgtgcctt 
acctggatgg 
gaagaatcgc 
gcaagtttgt 
atgatagtag 
agagttaggc 
cccgacaggc 
cgattagtga 



caaccaccac 
tttgggccac 
atgtgacaga 
taatcagttt 
gtttaaagtg 
taatggagaa 
aggtgcagaa 
gctataggtt 
ttgagccaat 
ataagacgtt 
gaattcgacc 
tagtaattag 
catctgtaga 
agaggggacc 
attgtaacat 
gagaacaatt 
aaattgtaac 
tgtttaatag 
gaagtgacac 
taggaaaagc 
ctgggctgct 
ctggaggagg 
aaattgaacc 
agcgcgcagt 
tgggcgcagc 
agcagcagaa 
tctggggcat 
aacagctcct 
ggaatgctag 
agtgggacag 
aaaaccagca 
ggaattggtt 
gaggcttggt 
agggatattc 
ccgaaggaat 
acggatcc 



tctattttgt 
acatgcctgt 
aaattttaac 
atgggatcaa 
cactgatttg 
aggagagata 
agaatatgca 
gataagttgt 
tcccatacat 
caatggaaca 
agtagtatca 
atctgccaat 
aattaattgt 
agggagagca 
ttctagagca 
tggaaataat 
gcacagtttt 
tacttggttt 
aatcacactc 
aatgtatgcc 
attaactcga 
cgatatgagg 
attaggagta 
gggaatagga 
gtcaatgacg 
caatttgctg 
caaacagctc 
ggggatttgg 
ttggagtaat 
agaaattaac 
agaaaagaat 
taacataaca 
aggtttaaga 
accattatcg 
agaagaagaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2148 



<210> 10 
<211> 6229 



<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: synthetic DNA 
<220> 

<221> sig_peptide 
<222> (1293) . . (1295) 
<223> env ATG 

<220> 

<221> misc_feature 
<222> (1377) . . (1379) 
<223> env AGT , gp!20 start 

<220> 

<221> misc_feature 

<222> (1397) . . (1403) 

<223> BstEII cleavage site 

<220> 

<221> misc_f eature 
<222> (3537) . . (3542) 
<223> BamHI cleavage site 

<220> 

<221> misc_feature 
<222> (3855) . . (3857) 
<223> env TAA, stop 



<400> 10 

ctgacgcgcc 

ccgctacact 

ccacgttcgc 

ttagtgcttt 

ggccatcgcc 

gtggactctt 

tataagggat 

ttaacgcgaa 

caactgttgg 

gggatgtgct 

taaaacgacg 

catagcccat 

ccgcccaacg 

atagggactt 

gtacatcaag 

cccgcctggc 

tacgtattag 

ggatagcggt 

ttgttttggc 

acgcaaatgg 

aaccgtcaga 

gacaattcga 

tgtggagatg 

ctacagaaaa 

ccactctatt 



ctgtagcggc 
tgccagcgcc 
cggctccccc 
acggcacctc 
ctgatagacg 
gttccaaact 
tttgccgatt 
ttttaacaaa 
gaagggcgat 
gcaaggcgat 
gccagtgagc 
atatggagtt 
acccccgccc 
tccattgacg 
tgtatcatat 
attatgccca 
tcatcgctat 
ttgactcacg 
accaaaatca 
gcggtaggcg 
tcgcctggag 
gctcggtacc 
ggggtggaaa 
attgtgggtc 
ttgtgcatca 



gcattaagcg 
ctagcgcccg 
cgtcaagctc 
gaccccaaaa 
gtttttcgcc 
ggaacaacac 
tcggcctatt 
atattaacgc 
cggtgcgggc 
taagttgggt 
gtctagttat 
ccgcgttaca 
attgacgtca 
tcaatgggtg 
gccaagtacg 
gtacatgacc 
taccatggtg 
gggatttcca 
acgggacttt 
tgtacggtgg 
acgccatcca 
gtcgacgcca 
tggggcacca 
accgtctatt 
gatgctaaag 



cggcgggtgc 
ctcctttcgc 
taaatcgggg 
aacttgatta 
ctttgacgtt 
tcaaccctat 
ggttaaaaaa 
ttacaatttc 
ctcttcgcta 
aacgccaggg 
taatagtaat 
taacttacgg 
ataatgacgt 
gagtatttac 
ccccctattg 
ttatgggact 
atgcggtttt 
agtctccacc 
ccaaaatgtc 
gaggtctata 
cgctgttttg 
ccatgagagc 
tgctccttgg 
atggggtacc 
catatgatac 



ggtggttacg 

t t tCt tCCC t 

gctcccttta 
gggtgatggt 
ggagtccacg 
ctcggtctat 
tgagctgatt 
cattcgccat 
ttacgccagc 
ttttcccagt 
caattacggg 
taaatggccc 
atgttcccat 
ggtaaactgc 
acgtcaatga 
ttcctacttg 
ggcagtacat 
ccattgacgt 
gtaacaactc 
taagcagagc 
acctccatag 
gaaggagaag 
gatattgatg 
tgtgtggaag 
agaggtacat 



cgcagcgtga 
tcctttctcg 
gggtnccgat 
tcacgtagtg 
ttctttaata 
tcttttgatt 
taacaaaaat 
tcaggctgcg 
tggcgaaagg 
cacgacgttg 
gtcattagtt 
gcctggctga 
agtaacgcca 
ccacttggca 
cggtaaatgg 
gcagtacatc 
caatgggcgt 
caatgggagt 
cgccccattg 
tcgtttagtg 
aagacaccgg 
tatcagcact 
atctgtagtg 
gaagcaacca 
aatgtttggg 



60 

120 

18(5 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



ccacacatgc 
cagaaaattt 
gtctatggga 
agcgcactga 
agaaaggaga 
agaaagaaca 
ggctgataag 
caattcccat 
cgttcaatgg 
ggccagtagt 
tcagaccngc 
cagaaaccaa 
gaccagggag 
acatcagtag 
aactcggaaa 
taacgcacag 
acagtacttg 
acacaatcac 
aagcaatgta 
tgctaccaac 
gaggcgacat 
aaccattagg 
cagcgggaac 
gcacgtcaat 
agaacaattt 
gcatcaaaca 
tcctggggat 
ctagttggag 
acagagaaat 
agcaagaaaa 
ggtttaacac 
cggtaggttt 
acccaccatt 
gaacagaaga 
cctcagcact 
gagacttact 
ccctcaaata 
ttaacttgct 
cattacaagc 
aaaggatttt 
gccgtaaggg 
cgagatctag 
acaaaccaca 
tgctttattt 
ttttatgttt 
caaatgtggt 
ctctgacaca 
agacaagccc 
cagccacgta 
tactgagagt 
cctgacgagc 
caaagatacc 
ccgcttaccg 
ccacgctgta 
gaaccccccg 
ccggtaagac 
aggtatgtag 



ctgcgtaccc 
taacatgtgg 
tcaaagccca 
tctgaagaat 
gataaaaaac 
tgcattctct 
ctgtaacacc 
acatcattgt 
aacaggacca 
accaactcaa 
caacttcaca 
ttgtacaaga 
agcatttgtt 
agcaaaatgg 
taataaaaca 
ttttaattgt 
gtttaatagt 
actcccatgc 
tgcccctccc 
aagagatggt 
gagggacaat 
agtagcaccc 
aggagcttcg 
gacgctgacg 
gccgagggct 
gctccaggca 
ctggggttgc 
caacaaatct 
caacaattac 
gaatgaacaa 
aacaaattgg 
aagaatagtt 
accgtttcag 
agaaggtgga 
tatctgggac 
cttgattgta 
ttggtggaat 
caatgccaca 
agctcataga 
gctataagat 
aaagaatgag 
actagaacta 
actagaatgc 
gtaaccatta 
caggttcagg 
atggctgatt 
tgcagctccc 
gtcagggcgc 
gcgatagcgg 
gcaccatacg 
accacaaaaa 
aggcgcttcc 
gatacccgtc 
ggtatctcag 
ttcagcccga 
acgacttacc 
gcggtgctac 



acagacccca 
aaaaatgaca 
aagccacgcg 
gatactzaaca 
tgctctctca 
cataaactcg 
tcagccatca 
gccccggctg 
cgtacaaatg 
ctgctgtcaa 
gacaatgcta 
cccaacaaca 
acaataggaa 
aacgccactt 
ataacctcta 
ggaggggaac 
actcggagca 
agaacaaaac 
atcagtggac 
ggcaataaca 
tggagaagtg 
accaaggcaa 
ctccttgggt 
gtacaggcca 
attgaggcgc 
agaatcctgg 
cccggaaaac 
ctggaacaga 
acaagctcaa 
gaattactgg 
ctgtggtaca 
ctcgctgtac 
acccacctcc 
gagagagaca 
gacccgcgga 
acgaggattg 
cccccacagt 
gccatagcag 
gctattcgcc 
gggtggcaag 
acgagccgag 
gcctcgaccc 
agtgaaaaaa 
caagctgcaa 
gggaggtgcg 
atgatcctgc 
ggagacggcc 
gtcagcgggt 
agtgtatact 
ccgggccgcg 
ccgacgccca 
ccctggaagc 
cgccctcctc 
ctcggtgtag 
ccgctgcgcc 
gccactggca 
agagttcccg 



acccacaaga 
tggtagaaca 
taaaattaac 
ccaacagtag 
atatcagcac 
atacagcacc 
cacaggcccg 
gttttgcgac 
tcagcacagc 
atggcagccu 
aaaccacaac 
atacaagaaa 
aaacaggaaa 
taaaacagac 
agcaaccctc 
ttttctaczg 
ccgaagggtc 
aac ttataaa 
aaatcagacg 
acaacgggcc 
aac tatataa 
agagaagagc 
ccu-gggagc 
gacaatcacc 
aacagcacc- 
ctgnggaaag 
tcacttgcac 
tttggaataa 
tacactccc~ 
aattagataa 
taaaattac" 
ttectatagc 
caatcccgag 
gagacaga~c 
gcctgtgcc- 
tggaacccc z 
ac-ggagcca 
cagctgaggg 
acacacctag 
tggtcaaaaa 
ccagcagcag 
agacatgaca 
atgctttac- 
taaacaagc" 
ggaggtcc-c 
ctcgcgcgcc 
acagcttgcc 
gceggcgggt 
ggctcaacta 
ttgctggcgt 
agtcagaggc 
tccctcgcgc 
ccttcgggaa 
gtcgctcgcc 
ttatccggca 
gcagccaccg 
aagtggcggc 



agtagtattg 
gatgcacgag 
cccactctgt 
cagcgggaga 
aagcacaaga 
aatagataac 
cccaaaggta 
tctaaaatgt 
acaacgcaca 
agcagaagaa 
agtacagctg 
aagtatccgc 
catgagacaa 
agccagcaaa 
aggaggggac 
taattcaaca 
aaataacact 
cacgtggcag 
tccaccaaat 
c gaga tc tec 
atacaaagta 
ggegcagaga 
agcaggaagc 
gtccgacata 
gccgcaactc 
atacctaaag 
cactgctgtg 
catgacctgg 
aattgaagaa 
atgggcaagt 
cataatgata 
gaacagagtt 
gggacccgac 
cacccgatta 
cctcagccac 
gggaegcagg 
ggaactaaag 
gacagacagg 
aagaataaga 
gtagtgtgat 
atggggtggg 
agacacactg 
cgegaaact c 
aacaacaaca 
taaagcaagt 
ccggtgatga 
tgcaagcgga 
gteggggege 
tgeggcatea 
ttttccatag 
ggcgaaaccc 
gctctcctgt 
gcgtggcgct 
ccaagctggg 
actategtet 
gtaacaggat 
etaactaegg 



gcaaatgtga 
gacataacca 
gtcagtttaa 
atgataaegg 
gataaggtgc 
accagctata 
ccccttgagc 
aacaacaaga 
cacggaacca 
gaegtagtaa 
aacacatctg 
atccagaggg 
gcacattgta 
ttaagagaac 
ccagaaattg 
caactgttta 
gaaggaagcg 
gaagtaggaa 
attactgggc 
agacceggag 
gtaaaaatcg 
gaaaaaagag 
accatgggcc 
gtgeagcage 
acagtctggg 
gatcaacagc 
ccctggaatg 
atggagtggg 
tcgcaaaacc 
ttgtggaatt 
gtaggaggct 
aggcagggat 
aggeccgaag 
gtgaaeggat 
caccgcttga 
gggtgggaag 
aacagtgctg 
gtcauagaag 
cagggectgg 
tggatggcct 
agcagtatct 
atgagcttgg 
gegatgetae 
attgeattea 
aaaacctcta 
eggtgaaaac 
tgeegggage 
agccatgacc 
gagcagattg 
gctccgcccc 
gacaggacta 
cccgaccctg 
ttctcatagc 
ctgtgtgcac 
tgagtccaac 
tagcagagcg 
ctacactaga 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 



aggacagtat 
agctcttgat 
cagattacgc 
gacgctcagt 
atcttcacct 
gagcaaactn 
tgtctatttc 
gagggcttac 
ccagattcat 
actttatccg 
ccagttaaca 
tcgtttggta 
cccatgttgt 
ttggccgcag 
ccatccgcaa 
tgtatgcggc 
agcagaactt 
atcttaccgc 
gcatctttta 
aaaaagggaa 
tattgaagca 
aaaaataaac 



ttggtatctg 
ccggcaaaca 
gcagaaaaaa 
ggaacgaaaa 
agatcctttt 
ggtctgacag 
gttcacccat 
catctggccc 
cagcaataaa 
cctccaccca 
gtttgcgcaa 
tggcttcatt 
gcaaaaaagc 
tgttatcact 
gatgcttttc 
gaccgagttg 
taaaagtgct 
tgttgagatc 
ctttcaccag 
taagggcgac 
tttatcaggg 
aaataggggt 



cgctctgctg 
aaccaccgct 
aggatctcaa 
ctcacgtcaa 
aaattaaaaa 
ttaccaatgc 
agttgcctga 
cagtgctgca 
ccagccagcc 
gtctattaat 
cgttgttgcc 
cagctccggt 
ggttagctcc 
catggttatg 
tgtgactggt 
ctcttgcccg 
catcattgga 
cagttcgatg 
cgtttctzggg 
acggaaatgt 
ttattgtctc 
tccgcgcaca 



aagccagtta 
ggtagcggtg 
gaagatcctt 
gggattttgg 
tgaagtttta 
ttaatcagtg 
ctccccgtcg 
atgataccgc 
ggaagggccg 
tgttgccggg 
attgctacag 
tcccaacgac 
ttcggtcctc 
gcagcactgc 
gagtactcaa 
gcgtcaatac 
aaacgttctt 
taacccactc 
tgagcaaaaa 
tgaatactca 
atgagcggac 
ttztccccgaa 



ccttcggaaa 
gtttttttgt 
tgatctttcc 
tcatgagatt 
aatcaatcta 
aggcacctat 
tgtagacaac 
gagacccacg 
agcgcagaag 
aagctagagt 
gcaccgtggt 
caaggcgagt 
cgatcgttgt 
ataattctct 
ccaagtcatt 
gggataatac 
cggggcgaaa 
gtgcacccaa 
caggaaggca 
tactcttcct 
acatatttga 
aagcgccac 



aagagttggt 
ttgcaagcag 
tacggggtct 
atcaaaaagg 
aagtatatat 
ctcagcgatc 
tacgacacgg 
ctcaccggct 
tggtcctgca 
aagtagttcg 
gtcacgctcg 
tacatgatcc 
cagaagtaag 
tactgtcatg 
ctgagaatag 
cgcgccacat 
actctcaagg 
ctgatcctca 
aaatgccgca 
ttttcaatat 
atgcatttag 



4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6229 



<210> 11 
<21i> 860 
<212> DMA 

<213> Human immunodeficiency virus 
<220> 

<221> misc_f eature 
<222> (1) . . (860) 

<223> PI-932 original sequence Vl-V2-V3-loop 



<400> 11 

tgtgtaccca 

aacatgtgga 

caaagcctaa 

gctgatttaa 

actcataaca 

ttcaaggtca 

cttgatacag 

aacacctcga 

ttttgtgccc 

ggaccatgta 

actcaactgc 

ttcacagaca 

ataagacccc 

acaacaagaa 

tggaataaca 



cagaccccaa 
aaaatgacat 
agccatgtgt 
attgcaataa 
atactattag 
ccacaggcat 
taccaataga 
ccattacaca 
cggctggttt 
aaaatgtcag 
tgttgaatgg 
atgctaaaac 
acaacactgt 
aaataaaagg 
ctttacaaga 



cccacaaaag 
ggtagaacag 
aaaactaacc 
tactgattta 
tgggaaaata 
aagggataag 
taatgataag 
ggcctgtcca 
tgcgattcta 
cacagtacaa 
cagtctagca 
cataatagta 
aacagacagg 
agatataaga 



gtagtattgg 
atgcatgagg 
ccactctgtg 
aattgcacta 
atagagaaag 
atgcaaaaag 
aataatacta 
aaggtatcct 
aagtgtaatg 
tgtacacatg 
gaaaaagaag 
cagctgaagg 
atacatatag 
caagcacatt 



aaaatgtgac 
acacaatcaa 
ttactttaaa 
aagctaattt 
tagaaataaa 
aatatgcact 
actttatatt 
tcgagccaat 
aaaagagcta 
gaattaggcc 
tagtaattag 
aatctgtaaa 
ggccagggag 
gtagccttag 



agaaaatttt 60 
tttatgggat 120 
ttgcactgat 180 
ggggaaaaat 240 
aaactgctct 300 
tttgaatzaaa 360 
gataagttgt 420 
tcccatacat 480 
cagtggaaaa 540 
agtagtgtca 600 
atctgagaat 660 
cactacttgt 720 
atcatttcat 780 
gagaaaagat 840 
860 



<210> 12 

<211> 870 

<212> DMA 

<213> Artificial Sequence 



<220> 

<223> Description of the artificial sequence: PI-932 

gene cassette, comprising the cleavage sites for 
restriction enzymes BspTl, PstI, Bell, EcoRI, 
Bglll, PvuII, Xball, Nhel 



<400> 12 

tgtgtaccca cagaccccaa cccacaaaag gtagtattgg aaaatgtgac agaaaattnt 60 

aacatgtgga aaaatgacat ggtagaacag atgeatgagg acataatcaa tttatgggat 120 

caaagcc-ca agccatgtgt aaaactaacc ccactctgtg ttactttaaa ttgeactgat 180 

gctgatt-aa attgeaataa tactgattta aattgeacta aagctaattt ggggaaaaat 240 

actcataact gcagtattag tgggaaaata atagagaaag tagaaataaa aaactgetet 3 00 

ttcaaggtca ccacaggcat aagggataag atgcaaaaag aatatgeact tttgaataaa 3 60 

cttgatatag taccaataga taatgataag aataatacta actttatatt gataagttgt 420 

aacacctcgg tgatcacaca ggcctgtcca aaggtatcct ttgagccaat tcccatacat 480 

ttttgtgccc cggctggttt tgegattcta aagtgtaatg aaaagagtta cagtggaaaa 540 

ggaccatgea aaaatgtcag cacagtacaa tgtacacatg gaatteggee agtagtgtca 600 

actcaaccgc tgttgaatgg cagtctagca gaaaaagaag tagtaattag atctgagaat 660 

ttcacagaca atgetaaaac cataatagta cagctgaagg aacctgtaaa cattacttgt 720 

ataagacccc acaacaccgt aacagacagg atacatatag ggecagggag atcatttcat 780 

acaacaagaa aaataaaagg agacataaga caagcacatt gtagecttte tagaaaagat 840 

tggaataaca ctttacaaga gatagctagc 



870 



